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Inhomogeneous temporal processes in natural and social phenomena have been described by bursts 
that are rapidly occurring events within short time-periods alternating with long periods of low 
activity. In addition to the analysis of heavy-tailed interevent time distributions, higher-order 
correlations between interevent times, called correlated bursts, have been studied only recently. As 
the underlying mechanism behind such correlated bursts is far from being fully understood, we 
devise a simple model for correlated bursts using a self-exciting point process with variable range of 
memory. Whether a new event occurs is stochastically determined by a memory function that is the 
sum of decaying memories of past events. In order to incorporate the noise and/or limited memory 
capacity of systems, we apply two memory loss mechanisms: fixed number or variable number of 
memories. By analysis and numerical simulations, we find that too much memory effect may lead to 
a Poissonian process, implying that there exists an intermediate range of memory effect to generate 
correlated bursts comparable to empirical findings. Our conclusions provide deeper understanding 
of how long-range memory affects correlated bursts. 


I. INTRODUCTION 

Many natural phenomena and human activities are 
extremely inhomogeneous in time. Solar flares, earth¬ 
quakes il], firing of neurons [2j, and human communi¬ 
cation [3j are just some examples. In all these phenom¬ 
ena events occurring within short time periods, called 
bursts, are alternating with random, long periods of low 
activity [3]. Often the elements behaving in this manner 
constitute a temporal network [3j and then the processes 
such as spreading on the network are strongly influenced 
by the burstiness of the time series EHS1- Or burstiness 
can be influenced by spreading ms• The natural question 
raises: How to characterize the highly inhomogeneous 
dynamics and how to model it? This is important for 
discovering analogies between different processes leading 
to possible universalities and for understanding the effect 
of temporal inhomogeneities on the network processes. 

At the simplest level, the bursty time series is charac¬ 
terized by the heavy-tailed interevent time distribution 
P(t), where r is the time interval between two consec¬ 
utive events. P(t) has often been described by a power 
law: 

P(T) ~ T~ a . (1) 

However, the interevent time distribution does not pro¬ 
vide a complete characterization. The higher order de¬ 
scription of bursts focuses on dependencies between in¬ 
terevent times, i.e., higher order memory effects. These 
can be approached in different ways. One possibility is 
to calculate the autocorrelation function. For this ap¬ 
proach Goh and Barabasi defined a memory coefficient 
measuring short-range memory effect m as following: 

M = (in - COXa-h - (t))) , 2 , 


where t* denotes the ith interevent time, and (r) and 
a are the average and standard deviation of interevent 
times. The aim was to characterize the bursty time series 
by two quantities, M and the burstiness parameter B , 
defined as 


It was found in many human activities that M is close 
to 0. To describe long-range memory effects, one can 
use the entire autocorrelation function of the time series. 
Recently, it was shown that the power-law decay of the 
autocorrelation function is implied by a power-law in¬ 
terevent time distribution even without any correlations 
between consecutive interevent times. Precisely, the scal¬ 
ing relation a + 7 = 2 was proven with 7 denoting the 
decaying exponent of the autocorrelation function [12| . 
However, more works need to be done for the validity of 
the scaling relation, as the effect of dependencies between 
interevent times in the bursty time series is not yet fully 
understood. 

An approach sensitive to dependencies was recently in¬ 
troduced by using the notion of bursty trains usa. a 
bursty train is defined as a set of events, such that any 
pair of consecutive events in the train is separated by 
an interevent time within a given time window At. The 
distribution of the number E of events in the trains fol¬ 
lows an exponential function if the interevent times are 
independently and identically distributed. It was found, 
however, that in many empirical cases E is power-law 
distributed, i.e., 

P At {E) ~ E~P (4) 

for a wide range of At. This notion of correlated bursts 
was empirically observed in earthquakes, neuronal activ- 
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ities, and human communication patterns m- Such cor¬ 
relations are clearly due to memory effects. 

Generative models for correlated bursts have been de¬ 
vised and studied. Karsai et al. introduced two-state 
model with memory function U3J: One state is for gen¬ 
erating power-law distributed interevent times that are 
uncorrelated, while the other state is for generating short- 
timescale bursty trains. For the latter, they define a 
memory function as the probability of generating one 
more event in the train provided that l events have al¬ 
ready been generated in the train: 



Here a and (3 are directly controlled by parameters for 
memory functions, e.g., /3 = v + 1. In this model, the 
onset of bursty trains is assumed to be known or at least 
declared in order to use the above memory function, re¬ 
quiring additional information. Such assumption is not 
necessarily the case in reality. Thus in this paper we 
suggest more natural and intuitive mechanism for corre¬ 
lated bursts that does not need declaring bursty trains. 
We also investigate the robustness of the scaling relation 
a + 7 = 2 with respect to the strength of dependencies 
between interevent times. 


II. MODEL 

We study a generative model for correlated bursts with 
variable range of memory effect. In our model, bursty 
trains emerge from the stochastic process using a memory 
function. Note that our memory function has nothing to 
do with Eq. ©• At time step t = 1, the first event 
occurs, and the ith event occurs at time step t = ti. The 
probability that the (n + l)th event occurs at time step 
t is given by 

p[m(t)\ = i- e -^W- £ ( 6 ) 

n 1 

m{t) = E 7~r for t > tn ’ ( 7 ) 

i=l 1 u 

where m(t) denotes the memory function that is the 
time-weighted sum of all the past events. Accordingly, 
p controls the strength of memory effect, such that the 
larger p implies the stronger memory effect. Here we 
use e = 10 -6 to indicate spontaneous events taking place 
with very small probability. Once the (n + l)th event oc¬ 
curs, the memory term due to this event is added to the 
memory function. Note that t is discrete and t — ti > 1. 

We remark that our model can be considered as a self¬ 
exciting point process 1 141 - 1161 with a power-law kernel. 
These processes have been extensively studied for earth¬ 
quakes UMB as well as in application to social sys¬ 
tems (22,1123j. In such processes, the time-varying event 
rate is given as a function of the past events. As for 
the kernel, the Omori’s law has been widely used, stat¬ 
ing that aftershock frequency decreases with an elapsed 


time after the main shock, e.g., in a form of (t — ti ) -1-0 
with small positive 6 [18]. The self-exciting point process 
with Omori’s law is called epidemic-type aftershock se¬ 
quence (ETAS) model. Note that our memory function in 
Eq. |7| corresponds to the case with 9 = 0. Analytic and 
numerical approaches to the ETAS model have shown 
that interevent time distributions are mostly described 
by a Gamma function m, implying that a < 1. How¬ 
ever, one finds evidence for a > 1 in many other natural 
and social phenomena [2411261 . Despite the similarity be¬ 
tween our model and the ETAS model, our model shows 
different results such as a > 1 . 

As a novel feature compared to the family of ETAS 
models, we introduce memory loss mechanisms as most 
systems may lose their memory by various reasons, e.g., 
noise, limited memory capacities, or periodic resetting 
in circadian cycles of humans m- We incorporate the 
sequential memory loss mechanism by considering only a 
finite number L of terms in Eq. 0: 


m(t) 


E 

i=n-L -\-1 



for t > t n . 


( 8 ) 


This implies that once an event occurs, the memory due 
to the oldest event is immediately lost. Here L = 1 
implies no memory before the latest event. Note that 
without memory loss, i.e., when L is infinite, m(t) might 
diverge as In t.. 

We can consider more realistic memory loss mecha¬ 
nisms depending on the systems of interest. For example, 
the condition of the fixed L can be relaxed by considering 
variable L. Whenever an event occurs, m(t) is initialized 
except for the latest event, i.e., by setting L = 1, with a 
probability 


q[L(t)] = 1 


' m - 

L[f) + 1 


+ cl, 


(9) 


where Lit) is the number of terms in memory function at 
time t. This can be called preferential memory loss mech¬ 
anism. Here we have introduced the spontaneous initial¬ 
ization of m(t) with very small cl = 10 ~ 6 , otherwise if 
cl = 0 , q(L ) may be extremely small due to extremely 
large L(t ) and vice versa. With this q(L), we expect that 
the distribution of L is proportional to L~ u ~ l e~ L ^ Lc with 
L c = ef 1 . We will study both memory loss mechanisms 
one by one. 

We remark that our model is intrinsically non¬ 
stationary due to the long-range memory effect. How¬ 
ever, non-stationary periods are limited to timescales of 
the order of e -1 , as to be numerically confirmed by the 
decaying behavior of autocorrelation function for the de¬ 
lay time of the order of e -1 . 
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III. RESULTS 


A. Sequential memory loss 


In general, the probability of finding an interevent time 
r between events occurred at t n and t n + t is written as 


P(r) 


~tn +T— 1 

e -nm(t)-e 

l _ e -[im(t n +T)-e 

_i=i„ +1 

L J 


( 10 ) 


This formula is exact as the model is defined in the dis¬ 
crete time t, while the formula for continuous time can 
be found in H23- Let us first consider the simplest case 
when the model has no memory before the latest event, 
i.e., L = 1. Since the distribution does not depend on t n 
but only on t—t n , we set t n = 0 without loss of generality. 
Then, we use 


m(t) = j. ( 11 ) 

The numerical result of m(t) is depicted in Fig. [IJa). This 
can be related to time-varying priority queuing models 
studied in [ 21 ], where the decaying priority of the task 
was considered as ~ t~ a . One gets the interevent time 
distribution: 


P(r) 


JI e-^- e 

_t= 1 



exp [—/iln(r — 1 ) — e(r — 1 )] + e'j 





( 12 ) 


In the last line, we have assumed r 1. This analytic 
result perfectly fits the numerical results even for small 
values of r, see Fig. [2ja). For numerical simulations, we 
have generated the event sequence consisting of up to 10 6 
events using fi = 1/10 for all cases. The bump observed 
for large r is clearly due to the Poisson events with posi¬ 
tive e. We find the power-law exponent of interevent time 
distribution to be 


f 1 + ^ for r <C /rr c , 

( M for M r c t t c . 


(13) 


When t c = 10 6 and /i = 1/10, the scaling regime with 
a = fi turns out to be almost invisible. Thus the domi¬ 
nant scaling behavior is characterized by a = 1 + /i. 

Since all interevent times are fully uncorrelated, the 
bursty train distribution is given by |T3| 

PAt(E) = F(At) E ~ 1 [1 — -F(At)], (14) 

At 

F(At) = £>(r). (15) 

T =1 


For E 1, one 
bursty trains as 


gets the exponential distribution of 


e ~E/E c (At) -^_ e -l/E 0 (At.) 



FIG. 1: (Color online) Temporal evolution of memory func¬ 
tion m(t) in the model with sequential memory loss mecha¬ 
nism, where the number of memories is denoted by L. We 
used = 1/10, and L = 1 (a), 10 (b), and 100 (c). 


with E c (At) = — [lnF(At)] 1 , which is numerically con¬ 
firmed in Fig. [2](b). In case of e = 0, we have E c (At) « 
(A t)» for At » 1 . 

In order to calculate the autocorrelation function, we 
first denote the event sequence by x(t) that has the value 
of 1 at the moment of event occurred, 0 otherwise. The 
autocorrelation function with delay time td is defined as 


A(f \ _ {x{t)x{t + t d )) t - ( x(t)) 2 t 

{d] { X mt - 


(17) 


For the power-law interevent time distribution, one may 
find that A(t d ) ~ t^ 1 . For the uncorrelated interevent 
times, it has been proved that a + 7 = 2 m- This scal¬ 
ing relation is numerically confirmed with the estimated 
value of 7 in Fig. [2](c). 

Next, we consider the case of L = 2, when the memory 
function is composed of two terms corresponding to the 
latest event and the second latest event, respectively. The 
interevent time between those two events is denoted by 
T\. Again we set t n = 0 in Eq. (10 1 . The conditional 
memory function reads 


, , x 1 1 
m{t\T 1 ) = - + —— , 
t t + Ti 


(18) 


leading to the conditional interevent time distribution 
P(t|ti) as 

P{t |n) « e _ ' i/(T|ri)_T/To [/rg(r|ri) + r c _1 ], (19) 

/(t|ti) = lnr + ln(r + Ti)-Inri, ( 20 ) 


PAt(E) 


t t + n 


( 21 ) 
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FIG. 2: (Color online) Interevent time distributions P(r) (left), bursty train distributions PAt(E) (center), and autocorrelation 
functions A(td) (right) in the model with sequential memory loss mechanism, where the number of memories is denoted by L. 
We used (a = 1/10, and L = 1, 2, 10, and 100 (from top to bottom). 


If r <C T \, we get 


which is however not trivial. Instead we find that the 
leading term of Eq. (l22| is not explicitly dependent on 


/ \ xgj co w 4' v | y ° ^ __l: y 1 ^ U1 J uu r u 

P(r|ri) ss + [ — H-) r _ + (22) T h and that T \ appears in Eq. (23) only as a coefficient. 

\ r i T cJ Thus we expect that P(t) ss P(rpr x ) with n in Eq. (19) 


On the other hand, if r n, we get 


P(r|ri) « if 


2^r _(1+2At) + 




o-t/t 


(23) 


Then, P(r) could be obtained by solving the following 
self-consistent equation: 


p ( r ) = 


(24) 


replaced by a crossover interevent time r x ■ We numeri¬ 
cally estimate r x « 70.2 by fitting P(r|r x ) to the simula¬ 
tion result of P(r), shown in Fig. [2jd) . In sum, provided 
that r x <C 2 jxr c <C r c , one can get the power-law expo¬ 
nent: 


a 


1 + fJ for r <C r x , 

1 + 2 n for r x <C t <C 2/rr c , 
2/x for 2^r c -C t <C r c . 


(25) 
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E events 


-——-————- - ** 

TE TE-l • • ' T 1 T 0 


FIG. 3: An event train of E events, where to, te > At and 
Ti < At for i = 1, ■ • • , E — 1. 


The bursty train distribution can be calculated as 


leading to P(r) ~ r~ a with a = 1 + k/j,. Similarly to 
the case of L = 2, one can infer the scaling behavior of 
P(t) ~ r~“ as following: 


' 1 + ^ for t < r x i , 

1 + 2 fi for t x i < r <C r x 2 , 


a = 


1 + i/r f° r r Xi-l -C T <C I//XT C , 

, L/r for L/Ltr c < r < r c . 


(35) 


oo At E—1 

PA t (p)=c y e n ^(pip + o 

To,TB=At+l n,-" ,TB^*=l t=0 

(26) 

with a normalization constant C. An example of event 
train is shown in Fi g. [3| Here we decompose the in¬ 
terevent times in Eq. (26) by assuming that P(rj|T,+i) « 


P(Tj|l), because r»+i = 1 will contribute the most. We 
get 


\E -1 


P At (P) oc F(At\iy 

At 

F{At\T f ) = ^P(r|r'), 


(27) 

(28) 


T=1 


where r' denotes the previous interevent time. This 
approximation is compared to the numerical results in 
Fig. [2](e). In addition, for the autocorrelation function 
we numerically find 7 = 0.81(1) in Fig. [2jf ) that fits 
the scaling relation a + 7 = 2 with a = 1.2 for the 


regime of large r in Eq. (25). It implies that the depen¬ 


dency between consecutive interevent times is not strong 
enough for leading to the violations of the scaling relation 
a + 7 = 2. 

In general, we have L terms in the memory function: 


*(f|{Ti}) = 7 + E 1 


t + Si 


(29) 


where s; = X)j=i T j with tj denoting the time interval 
between the jth latest and (j + l)th latest events. We 
straightforwardly get 


P (.T\{Ti}) W 

e ~,f(r |{n})-r/T e [ w ( T | {Ti } )+7 .-i; 

1. 

(30) 


L_1 / T \ 


/(t|{tJ) = 

,„ T + y>(i + ), 

2=1 x 7 

(31) 

ffH-f+i}) = 

1 L ~ 1 1 
+ E • 

T f-' T + Si 

2 = 1 

(32) 


For s fc _i <C r <C s/c, we get 


fc-i 


f( T \{Ti}) S 

3 k In t — In Si , 

(33) 

9(r\{Ti}) s 

k 1 

s +E 

T Si - S k -1 

i=k 

(34) 


with crossover interevent times r x j for i = 1, ■ • • , L — 1, 
provided that r x i -C ••• -C t x e~ 1 -C Lfir c <C r c . This 
implies that the scaling behavior cannot be described by 
a single value of power-law exponent. We instead calcu¬ 
late the local exponent ai oca i, 


In P(ur) — In P(r) 

^local \T) = , 7 \ , 

In(ur) — in r 


(36) 


with a proper constant u ss 3.3. Indeed, such local expo¬ 
nents show gradually increasing behaviors as r increases 
for the cases of large L, shown in the insets of Fig. §g,j)- 
The bursty train distribution can be written as 


00 At 00 

p &li E) = a Y. E E 

TQ,TE=At + l Ti,”* ,T E - 1 = 1 Te+ l,'” i r E + L- 2 = 1 


E -1 

P(r E ) I] P(n\ T~i+1 )'** j'T’z+L—l)? 
2=0 


(37) 


where C is a normalization constant and te+ 1 , 
te+l- 2 are dummy variables once rs > At. For 
small At, by assuming that P(Ti\r i+1 , - ■ ■ ,r i+ E-i) ~ 
P(rj|l, ,1), one gets P A t(P) being proportional to 
P(At|l, • • • ,1 ) B - 1 with 


At 

F(At|{T'}) = ^P(T|{T'}), (38) 

T=1 

where {r'} denotes the set of L — 1 previous interevent 
times. This result implies the exponential bursty train 
distribution. For large At, we numerically find scaling 
behaviors P A *(P) ~ E~@ with /3 ss 1.55(5) for L = 10 
and 1.46(3) for L = 100, but limited to the range of 
E < L as depicted in Fig. E h ,k). P At (P) has a natural 
exponential cutoff E c m L because L directly controls the 
range of memory effect. The autocorrelation functions for 
general L also show scaling behaviors with 7 « 0.58(1) 
for L = 10 and 0.53(1) for L = 100 in Fig. [2ji,l) . Since 
interevent time distributions are not described by a sin¬ 
gle value of power-law exponent, and the memory effect 
induces dependency between interevent times, we do not 
expect the scaling relation a + 7 = 2 to hold. 

Finally, let us consider the extreme case of L — > 00 . As 
all the past events contribute to the memory function, the 
fluctuation of 771 (f) must be considerably reduced so that 
the system eventually shows the memoryless Poissonian 
behavior, as supported by the decreasing fluctuation of 
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FIG. 4: (Color online) Estimated values of burstiness param¬ 
eter B in Eq. and memory coefficient M in Eq. § for 
different values of L in the model with sequential memory 
loss mechanism. We used /r = 1/10. 


m(t) as L increases in Fig. |T] In order to test our ex¬ 
pectation, we measure burstiness parameter B in Eq. ®> 
and memory coefficient M in Eq. ([2]) for the wide range of 
L. As depicted in Fig. [4j we find that both B and M in¬ 
crease and then decrease with increasing L, implying that 
there exists an optimal range of L (30 < L < 50) maxi¬ 
mizing burstiness and memory effect between interevent 
times. However, such optimal values of L , which play the 
role of cutoff in bursty train distributions, turn out to be 
too small compared to the empirical observations, e.g., 
in [T5] , 


B. Preferential memory loss 

In order to overcome the strong exponential cutoffs due 
to L itself, we study the preferential memory loss mecha¬ 
nism using Eq. ([9]) . The number of past events contribut¬ 
ing to the memory function until a new event occurs is 
now a random variable, denoted by L. The distribution 
of L is given by P(L ) oc L~ v ~ x e~ L ! Lc with L c = ej 1 . 
The interevent time distribution can be obtained from 

OO 

P(t) = Y j Pl(t)P(L), (39) 

L—l 


Pi(t) ~ r~ a with a = 1 + /r = 1.1 from Eq. (13). This 
is consistent with observations that as v increases, 7 ap¬ 
proaches 2 — a = 0.9 and increases considerably, imply¬ 
ing that the bursty train distribution becomes exponen¬ 
tial. The values of burstiness parameter B and memory 
coefficient M get also closer to those for the model with 
sequential memory loss mechanism with L = 1. 

For v < 2, as v approaches 0 from 2, we find that the 
tail of interevent time distribution becomes thinner be¬ 


cause P(t) in Eq. (39) is more influenced by the terms of 
Pl(t) with L > 1, which typically have larger values of 
power-law exponent. This is evidenced by the increasing 
behavior of a from 1.1 to 2 in Fig. [6] a). Since the very 
small v leads to the very small q(L ) in Eq. ([£]), the mem¬ 
ory function is rarely initialized so that some parts of the 
event sequence can be approximated by the model with 
the sequential memory loss mechanism for very large L. 
We indeed observe for u = 0.1 that the event sequence 
is made of dense event clusters spanning relatively long 
periods separated by long interevent times, as partly de¬ 
picted in Fig. [7] In each dense event cluster m(f) overall 
increases, but such non-stationary periods are limited to 
the timescale of the order of e)) 1 = 10 6 . As 771(f) in¬ 
creases but very slowly, it can be roughly approximated 
by a Poissonian process, supported by the estimation of 
7 « 0 and M Ri 0 in Fig. [S] In addition to 7 « 0, the 
autocorrelation function remains finite for wide range of 
t d [see Fig. E i)] because of the non-stationarity in m(t ) 
in dense event clusters. Note that the scaling relation 
a+7 = 2 seems to hold for v = 0.1 even when (3 ~ 1.5 and 
B ss 1, implying strong dependency between interevent 
times and strong burstiness effect. These can be under¬ 
stood as follows. As bursty trains are mostly measured 
in dense event clusters, they tend to contain more events, 
leading to the heavier tail for bursty train distributions 
and the smaller value of /3, i.e., [3 ~ 1.5. Relatively few 
but very large interevent times separating dense event 
clusters force B to get close to 1. 


IV. CONCLUSIONS 


where Pl(t) denotes the interevent time distribution for 
fixed L in the model with sequential memory loss mech¬ 
anism, i.e., Eq. ( [30| but with {t;} replaced by {r X j}. 
Numerical results for [i = 1/10 and for several values of 
v are shown in Fig. [5j and the estimated values of a, /3, 
and 7 as functions of v are plotted in Fig. [6][a,b). As 
v increases, a monotonically decreases, while (3 and 7 
monotonically increase. The value of f3 turns out to be 
larger than that of v + 1. Since E c ss L for each L, it is 
expected that /3 > v + 1. We also find that the scaling 
behavior in bursty train distributions is more robust with 
respect to the value of At, hence comparable to the em¬ 
pirical observations. The empirical values of power-law 
exponents are plotted in Fig. [b](a,b) for comparison i T3] , 
If v is sufficiently large, i.e., v > 2, the term for 
L = 1 becomes dominant in Eq. ([39]) , leading to P(t) ~ 


In order to investigate the underlying mechanism be¬ 
hind correlated bursts widely observed in natural phe¬ 
nomena and human activities, we have devised and stud¬ 
ied a simple model that is able to generate correlated 
bursts using a self-exciting point process with variable 
range of memory. Our model does not need to declare 
the bursty trains as compared to the previous two-state 
model for correlated bursts. In our model, a new event 
can occur depending on the memory function defined as 
the sum of decaying memories of past events. For in¬ 
corporating noise and/or the limited memory capacity of 
systems, we apply two different memory loss mechanisms: 
fixed number or variable number of memories, which we 
call sequential and preferential memory loss mechanisms, 
respectively. For each case, we obtain the interevent time 
distribution, bursty train distribution, and autocorrela- 





7 






FIG. 5: (Color online) Interevent time distributions P(t) (left), bursty train distributions PAt{E) (center), autocorrelation 
functions A(t c i) (right) in the model with preferential memory loss mechanism. We used n = 1/10 and v = 3, 1, and 0.1 (from 
top to bottom). The value of /3 was measured for At = 1024 in all cases. 





FIG. 6 : (Color online) Estimated values of a, /?, and 7 (a,b), and B and M (c) for different values of v in the model with 
preferential memory loss mechanism. We used p = 1/10. The value of /3 was measured for At = 1024 in all cases. We also 
plot the empirical values of a, /3 (filled symbols), and 7 (empty symbol) for neuron firings (diamond), earthquakes in Japan 
(inverse triangle), and mobile calls (square) [13) . 


tion function, all of which are characterized by power-law 
decaying exponents a, /3, and 7 , respectively, to study 
scaling relations among them. 

For the model with sequential memory loss mechanism, 
the memory function is given by the sum of decaying 
memories of L latest events, where L is a control param¬ 
eter. The simplest case with L = 1 has been exactly 


solved, also satisfying the scaling relation a + 7 = 2 [ 12 ]. 
Other simple cases could be analytically solved, while the 
general cases have been numerically studied. As L be¬ 
comes larger, the bursty train distribution shows scaling 
behavior for limited range of parameters, implying the 
emergence of correlated bursts. However, the number of 
events in bursty trains is strongly limited by L. Interest- 
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FIG. 7: (Color online) Temporal evolution of memory func¬ 
tion m(t) in the model with preferential memory loss mecha¬ 
nism. We used p, = 1/10 and v = 0.1. 

ingly, if L is extremely large, too much memory effect ef¬ 
fectively reduces the model to the Poisson process, which 
is confirmed by both memory coefficient and burstiness 
parameter approaching 0, i.e., no memory effect and no 
burstiness. 

In order to overcome the strong cutoff effect due to 
the fixed L, we have numerically studied the model with 
preferential memory loss mechanism. Here the number 
of memories L in the memory function increases gradu¬ 
ally but is set as 1, i.e., memory function initialization, 
with probability controlled by the exponent v. For suf¬ 
ficiently large v , the memory function is initialized fre¬ 
quently so that the model can reduce to the case with 
sequential memory loss mechanism using L = 1. On 
the other hand, for very small iz, the event sequence is 


composed of dense event clusters spanning long periods 
that are separated by very large interevent times. Dense 
event clusters may correspond to the case with sequen¬ 
tial memory loss mechanism using very large L, i.e., close 
to the Poisson process. For the intermediate range of v , 
we find evidences that our model generates correlated 
bursts, hence comparable to the empirical findings. 

As a followup, our models can be extended to incor¬ 
porate a number of complex realistic situations. For ex¬ 
ample, we can consider the context of events [29] , and a 
network of interacting individuals, each of which shows 
activities with correlated bursts. 
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